56 



f 



SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT : Lcbel, Peter 

Sleac , David S . 

(ii) TITLE Or INVENTION: NOVEL HUMAN LYSOSOMAL PROTEIN AND 
METHODS OF ITS USE 

(iii) NUMBER OF SEQUENCES: 12 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: David A. Jackson, Esq. 

(B) STREET: 411 Hackensack Ave, Continental Plaza, 4th 

Floor 

(C) CITY: Hackensack 

(D) STATE: New Jersey 

(E) COUNTRY: USA 

(F) ZIP: 07601 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Jackson Esq., David A. 

(B) REGISTRATION NUMBER: 26,742 

(C) REFERENCE /DOCKET NUMBER: 601-1-077 



57 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 201-437-5800 
(3) TELEFAX: 2 01-3-43-1584 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3437 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CGCGGAAGGG CAGAATGGGA CTCCAAGCCT GCCTCCTAGG GCTCTTTGCC CTCATCCTCT 60 

CTGGCAAATG CAGTTACAGC CCGGAGCCCG ACCAGCGGAG GACGCTGCCC CCAGGCTGGG 12 0 

TGTCCCTGGG CCGTGCGGAC CCTGAGGAAG AGCTGAGTCT CACCTTTGCC CTGAGACAGC 180 

AGAATGTGGA AAGACTCTCG GAGCTGGTGC AGGCTGTGTC GGATCCCAGC TCTCCTCAAT 240 

ACGGAAAATA CCTGACCCTA GAGAATGTGG CTGATCTGGT GAGGCCATCC CCACTGACCC 300 

TCCACACGGT GCAAAAATGG CTCTTGGCAG CCGGAGCCCA GAAGTGCCAT TCTGTGATCA 360 

CACAGGACTT TCTGACTTGC TGGCTGAGCA TCCGACAAGC AGAGCTGCTG CTCCCTGGGG 420 

CTGAGTTTCA TCACTATGTG GGAGGACCTA CGGAAACCCA TGTTGTAAGG TCCCCACATC 48 0 
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CCTACCAGCT TCCACAGGCC TTGGCCCCCC ATGTGGACTT TGTGGGGGGA CTGCACCATT 54 0 

TTCCCCCAAC ATCATCCCTG AGGCAACGTC CTGAGCCGCA GGTGACAGGG ACTGTAGGCC 600 

TGCATCTGGG GGTAACCCCC TCTGTGATCC GTAAGCGATA CAACTTGACC TCACAAGACG 660 

TGGGCTCTGG CACCAGCAAT AACAGCCAAG CCTGTGCCCA GTTCCTGGAG CAGTATTTCC 72 0 

ATGACTCAGA CCTGGCTCAG TTCATGCGCC TCTTCGGTGG CAACTTTGCA CATCAGGCAT 78 0 

CAGTAGCCCG TGTGGTTGGA CAACAGGGCC GGGGCCGGGC CGGGATTGAG GCCAGTCTAG 84 0 

ATGTGCAGTA CCTGATGAGT GCTGGTGCCA ACATCTCCAC CTGGGTCTAC AGTAGCCCTG 900 

GCCGGCATGA GGGACAGGAG CCCTTCCTGC AGTGGCTCAT GCTGCTCAGT AATGAGTCAG 960 

CCCTGCCACA TGTG CAT ACT GTGAGCTATG GAGATGATGA GGACTCCCTC AGCAGCGCCT 1020 

ACATCCAGCG GGTCAACACT GAGCTCATGA AGGCTGCTGC TCGGGGTCTC ACCCTGCTCT 1080 

TCGCCTCAGG TGACAGTGGG GCCGGGTGTT GGTCTGTCTC TGGAAGACAC CAGTTCCGCC 114 0 

CTACCTTCCC TGCCTCCAGC CCCTATGTCA CCACAGTGGG AGGCACATCC TTCCAGGAAC 1200 

CTTTCCTCAT CACAAATGAA ATTGTTGACT ATATCAGTGG TGGTGGCTTC AG CAATGTGT 1260 

TCCCACGGCC TTCATACCAG GAGGAAGCTG TAACGAAGTT CCTGAGCTCT AGCCCCCACC 1320 

TGCCACCATC CAGTTACTTC AATGCCAGTG GCCGTGC CTA CCCAGATGTG GCTGCACTTT 138 0 

CTGATGGCTA CTGGGTGGTC AGCAACAGAG TGCCCATTCC ATGGGTGTCC GGAACCTCGG 1440 

CCTCTACTCC AGTGTTTGGG GGGATCCTAT CCTTGATCAA TGAGCACAGG ATCCTTAGTG 1500 

GCCGCCCCCC TCTTGGCTTT CTCAACCCAA GGCTCTACCA GCAGCATGGG GCAGGACTCT 1560 

TTGATGTAAC CCGTGGCTGC CATGAGTCCT GTCTGGATGA AGAGGTAGAG GGCCAGGGTT 1620 
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TCTGCTCTGG TCCTGGCTGG GATCCTGTAA CAGGCTGGGG AACACCCAAC TTCCCAGCTT 1680 

TGCTGAAGAC TCTACTCAAC CCCTGACCCT TTCCTATCAG G AG AG ATGG C TTGTCCCCTG 174 0 

CCCTGAAGCT GGCAGTTCAG TCCCTTATTC TGCCCTGTTG GAAGCCCTGC TGAACCCTCA 18 00 

ACTATTGACT GCTGCAG AC A GCTTATCTCC CTAACCCTGA AATGCTGTGA GCTTGACTTG 186 0 

ACTCCCAACC CTACCATGCT C CAT CAT ACT CAGGTCTCCC TACTCCTGCC TT AG ATT C CT 192 0 

CAATAAGATG CTGTAACTAG CATTTTTTGA ATGCCTCTCC CTCCGCATCT CATCTTTCTC 1980 

TTTTCAATCA GGCTTTTCCA AAGGGTTGTA TACAGACTCT GTGCACTATT TCACTTGATA 2 04 0 

TTCATTCCCC AATTCACTGC AAGGAGACCT CTACTGTCAC CGTTTACTCT TTCCTACCCT 210 0 

GACATCCAGA AACAATGGCC TCCAGTGCAT ACTTCTCAAT CTTTGCTTTA TGGCCTTTCC 2160 

ATCATAGTTG CCCACTCCCT CTCCTTACTT AGCTTCCAGG TCTTAACTTC TCTGACTACT 2 22 0 

CTTGTCTTCC TCTCTCATCA ATTTCTGCTT CTTCATGGAA TGCTGACCTT CATTGCTCCA 228 0 

TTTGTAGATT TTTGCTCTTC T C AGTTT ACT CATTGTCCCC TGGAACAAAT CACTGACATC 2 34 0 

TACAACCATT ACCATCTCAC TAAATAAGAC TTTCTATCCA ATAATGATTG ATACCTCAAA 240 0 

TGTAAGATGC GTGATACTCA ACATTTCATC GTCCACCTTC CCAACCCCAA ACAATTCCAT 246 0 

CTCGTTTCTT CTTGGTAAAT GATGCTATGC TTTTTCCAAC CAAGCCAGAA ACCTGTGTCA 2 52 0 

TCTTTTCACC CCACCTTCAA TCAACAAGTC CTCAATCAAC AAGTC CTACT GACTGCACAT 2 58 0 

CTTAAATATA TCTTTATCAG TCCACAAGTC CTTCCAATTA TATTTCCCAA GTATATCTAG 264 0 

AACTTATCCA CTTATATCCC CACTGCTACT ACCTTAGTTT AGGGCTATAT TCTCTTGAAA 2 700 

AAAAGTGTCC TTACTTCCTG CCAATCCCCA AGTCATCTTC CAGAGTAAAA TGCAAATCCC 2 760 
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ATCAGGCCAC TTGGATGAAA AC CCTTCAAG GATTACTGGA TAGAATTCAG GCTTTCCCCT 
CCASCCCCCA ATCATAGCTC ACAAACCTTC CTTGCTATTT GTTCTTAAGT AAAAAATCAT 
TTTTCCTCCT CCCTCCCCAA ACCCCAAGGA ACTCTCACTC TTGCTCAAGC TGTTCCGTCC 
CCTTACCACC CCTGATACAA CTGCCAGGTT AATTT CCAGA ATTCTTGCAA GACTCAGTTC 
AGAAGTCACC TTCTTTCGTG AATGTTTTGA TTCCCTGAGG CTACTTTATT TTGGTATGGC 
TGAAAAATCC TAGATTTTCT AAACAAAACC TGTTTGAATC TTGGTTCTGA TATGGACTAG 
GAGAGAGACT GGGTCAAGTA AGCTTATCTC CCTGAGGCTG TTTCCTCGTC TGTTAAGTGT 
GAATATCAAT ACCTGCCTTT CATAATCACC AGGGAATAAA GTGGAATAAT GTTGATAACA 
GTGCTTGGCA CCTGGAAGTA GGTGGCAGAT GTTAACGCCC TTCCTCCCTT GCACTGCGCC 
CCCTGTGCCT ACCTCTAGCA TTGTAACGAC CACATAGTAT TGAAATGGCC AGTTTACTTG 
TCTGCCTTCC TTTCCAAGAC CGTTGGTGCC TAGAGGACTA GAATCGTGTC CTATTTAACT 
TTGTGTTCCC AGGTCCTAGC TCAGGAGTTG GCAAATAAGA ATTAAATGTC TGCTACACCG 
AAACAAA 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2520 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

CGCGGAAGGG CAGAATGGGA CTCCAAGCCT GCCTCCTAGG GCTCTTTGCC CTCATCCTCT 6 0 

CTGGCAAATG CAGTTACAGC CCGGAGCCCG AC C AGCGG AG GACGCTGCCC CCAGGCTGGG 120 

TGTCCCTGGG CCGTGCGGAC CCTGAGGAAG AGCTGAGTCT CACCTTTGCC CTGAGACAGC 18 0 

AGAATGTGGA AAGACTCTCG GAGCTGGTGC AGGCTGTGTC GGATCCCAGC TCTCCTCAAT 240 

ACGGAAAATA CCTGACCCTA GAGAATGTGG CTGATCTGGT GAGGC CATCC CCACTGACCC 300 

TCCACACGGT GCAAAAATGG CTCTTGGCAG CCGGAGCCCA GAAGTGCCAT TCTGTGATCA 36 0 

CACAGGACTT TCTGACTTGC TGGCTGAGCA TCCGACAAGC AGAGCTGCTG CTCCCTGGGG 420 

CTGAGTTTCA TCACTATGTG GGAGGACCTA CGGAAACCCA TGTTGTAAGG TCCCCACATC 480 

CCTACCAGCT TCCACAGGCC TTGGCCCCCC ATGTGGACTT TGTGGGGGGA CTGCACCATT 540 

TTCCCCCAAC ATCATCCCTG AGGCAACGTC CTGAGCCGCA GGTGACAGGG ACTGTAGGCC 600 

TGCATCTGGG GGTAACCCCC TCTGTGATCC GTAAGCGATA CAACTTGACC TCACAAGACG 660 

TGGGCTCTGG CACCAGCAAT AACAGCCAAG CCTGTGCCCA GTTCCTGGAG CAGTATTTCC 72 0 

ATGACTCAGA CCTGGCTCAG TTCATGCGCC TCTTCGGTGG CAACTTTGCA CATCAGGCAT 780 

CAGTAGCCCG TGTGGTTGGA CAACAGGGCC GGGGCCGGGC CGGGATTGAG GCCAGTCTAG 840 

ATGTGCAGTA CCTGATGAGT GCTGGTGCCA ACATCTCCAC CTGGGTCTAC AGTAGCCCTG 900 

GCCGGCATGA GGGACAGGAG CCCTTCCTGC AGTGGCTCAT GCTGCTCAGT AATGAGTCAG 960 
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CCCTGCCACA TGTGCATACT GTGAGCTATG GAGATGATGA GGACTCCCTC AGCAGCGCCT 1020 

ACATCCAGCG GGTCAACACT GAGCTCATGA AGGCTGCTGC TCGGGGTCTC ACCCTGCTCT 1080 

TCGCCTCAGG TGACAGTGGG GCCGGGTGTT GGTCTGTCTC TGGAAGACAC CAGTTCCGCC 1140 

CTACCTTCCC TGCCTCCAGC CCCTATGTCA CCACAGTGGG AGGCACATCC TTCCAGGAAC 12 0 0 

CTTTCCTCAT CACAAATGAA ATTGTTGACT ATATCAGTGG TGGTGGCTTC AGCAATGTGT 126 0 

TCCCACGGCC TTCATACCAG GAGGAAGCTG TAACGAAGTT CCTGAGCTCT AGCCCCCACC 1320 

TGCCACCATC CAGTTACTTC AATGCCAGTG GCCGTGCCTA CCCAGATGTG GCTGCACTTT 1380 

CTGATGGCTA CTGGGTGGTC AGCAACAGAG TGCCCATTCC ATGGGTGTCC GGAACCTCGG 144 0 

CCTCTACTCC AGTGTTTGGG GGGATCCTAT CCTTGATCAA TGAGCACAGG ATCCTTAGTG 1500 

GCCGCCCCCC TCTTGGCTTT CTCAACCCAA GGCTCTACCA GCAGCATGGG GCAGGACTCT 1560 

TTGATGTAAC CCGTGGCTGC CATGAGTCCT GTCTGGATGA AGAGGTAGAG GGCCAGGGTT 1620 

TCTGCTCTGG TCCTGGCTGG GATCCTGTAA CAGGCTGGGG AACACCCAAC TTCCCAGCTT 16 80 

TGCTGAAGAC TCTACTCAAC CCCTGACCCT TTCCTATCAG GAGAGATGGC TTGTCCCCTG 1740 

CCCTGAAGCT GGCAGTTCAG TCCCTTATTC TGCCCTGTTG GAAGCCCTGC TGAACCCTCA 18 00 

ACTATTGACT GCTGCAGACA GCTTATCTCC CTAACCCTGA AATGCTGTGA GCTTGACTTG 186 0 

ACTCCCAACC CTACCATGCT C CAT CAT ACT CAGGTCTCCC TACTCCTGCC TTAGATTCCT 192 0 

CAATAAGATG CTGTAACTAG CATTTTTTGA ATGCCTCTCC CTCCGCATCT CATCTTTCTC 1980 

TTTTCAATCA GGCTTTTCCA AAGGGTTGTA TACAGACTCT GTGCACTATT TCACTTGATA 2 040 

TTCATTCCCC AATTCACTGC AAGG AG AC CT CTACTGTCAC CGTTTACTCT TTCCTACCCT 2100 
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GACATCCAGA AACAATGGCC TCCAGTGCAT ACTTCTCAAT CTTTGCTTTA TGGCCTTTCC 2160 

ATCATAGTTG CCCACTCCCT CT C CTTACTT AGCTTCCAGG TCTTAACTTC TCTGACTACT 2 2 20 

CTTGTCTTCC TCTCTCATCA ATTTCTGCTT CTTCATGGAA TGCTGACCTT CATTG CTCCA 22 8 0 

TTTGTAGATT TTTGCTCTTC TCAGTTTACT CATTGTCCCC TGGAACAAAT CACTGACATC 234 0 

TACAACCATT ACCATCTCAC TAAATAAGAC TTTCTATCCA ATAATGATTG AT AC CT C AAA 24 00 

TGTAAGATGC GTGATACTCA ACATTTCATC GTCCACCTTC CCAACCCCAA ACAATTC CAT 2460 

CTCGTTTCTT CTTGGTAAAT GATGCTATGC TTTTTCCAAC CAAAAAAAAA AAAAAAAAAA 252 0 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Arg Lys Gly Arg Met Gly Leu Gin Ala Cys Leu Leu Gly Leu Phe Ala 
15 10 15 

Leu lie Leu Ser Gly Lys Cys Ser Tyr Ser Pro Glu Pro Asp Gin Arg 
20 25 30 
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Arg Thr Leu Pro Pro Gly Trp Val Ser Leu Gly Arg Ala Asp Pro Glu 
35 40 45 

Glu Glu Leu Ser Leu Thr Phe Ala Leu Arg Gin Gin Asn Val Glu Arg 
50 55 60 

Leu Ser Glu Leu Val Gin Ala Val Ser Asp Pro Ser Ser Pro Gin Tyr 
65 70 75 80 

Gly Lys Tyr Leu Thr Leu Glu Asn Val Ala Asp Leu Val Arg Pro Ser 
85 90 95 

Pro Leu Thr Leu His Thr Val Gin Lys Trp Leu Leu Ala Ala Gly Ala 
100 105 no 

Gin Lys Cys His Ser Val lie Thr Gin Asp Phe Leu Thr Cys Trp Leu 
115 120 125 

Ser lie Arg Gin Ala Glu Leu Leu Leu Pro Gly Ala Glu Phe His His 
130 135 140 

Tyr Val Gly Gly Pro Thr Glu Thr His Val Val Arg Ser Pro His Pro 
145 150 155 160 

Tyr Gin Leu Pro Gin Ala Leu Ala Pro His Val Asp Phe Val Gly Gly 
165 170 175 

Leu His His Phe Pro Pro Thr Ser Ser Leu Arg Gin Arg Pro Glu Pro 
180 185 190 

Gin Val Thr Gly Thr Val Gly Leu His Leu Gly Val Thr Pro Ser Val 
195 200 205 

lie Arg Lys Arg Tyr Asn Leu Thr Ser Gin Asp Val Gly Ser Gly Thr 
210 215 220 



Ser Asn Asn Ser Gin Ala Cys Ala Gin Phe Leu Glu Gin Tyr Phe His 
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225 230 235 240 

Asp Ser Asp Leu Ala Gin Phe Met Arg Leu Phe Gly Gly Asn Phe Ala 
245 250 255 

His Gin Ala Ser Val Ala Arg Val Val Gly Gin Gin Gly Arg Gly Arg 
260 265 270 

Ala Gly lie Glu Ala Ser Leu Asp Val Gin Tyr Leu Met Ser Ala Gly 
275 280 285 

Ala Asn lie Ser Thr Trp Val Tyr Ser Ser Pro Gly Arg His Glu Gly 
290 295 300 

Gin Glu Pro Phe Leu Gin Trp Leu Met Leu Leu Ser Asn Glu Ser Ala 
305 310 315 320 

Leu Pro His Val His Thr Val Ser Tyr Gly Asp Asp Glu Asp Ser Leu 
325 330 335 

Ser Ser Ala Tyr lie Gin Arg Val Asn Thr Glu Leu Met Lys Ala Ala 
340 345 350 

Ala Arg Gly Leu Thr Leu Leu Phe Ala Ser Gly Asp Ser Gly Ala Gly 
355 360 365 

Cys Trp Ser Val Ser Gly Arg His Gin Phe Arg Pro Thr Phe Pro Ala 
370 375 380 

Ser Ser Pro Tyr Val Thr Thr Val Gly Gly Thr Ser Phe Gin Glu Pro 
385 390 395 400 

Phe Leu lie Thr Asn Glu lie Val Asp Tyr lie Ser Gly Gly Gly Phe 
405 410 415 



Ser Asn Val Phe Pro Arg Pro Ser Tyr Gin Glu Glu Ala Val Thr Lys 
420 425 430 
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Phe Leu Ser Ser Ser Pro His Leu Pro Pro Ser Ser Tyr Phe Asn Ala 
435 440 445 

Ser Gly Arg Ala Tyr Pro Asp Val Ala Ala Leu Ser Asp Gly Tyr Trp 
450 455 460 

Val Val Ser Asn Arg Val Pro lie Pro Trp Val Ser Gly Thr Ser Ala 
465 470 475 480 

Ser Thr Pro Val Phe Gly Gly He Leu Ser Leu He Asn Glu His Arg 
485 490 495 

He Leu Ser Gly Arg Pro Pro Leu Gly Phe Leu Asn Pro Arg Leu Tyr 
500 505 510 

Gin Gin His Gly Ala Gly Leu Phe Asp Val Thr Arg Gly Cys His Glu 
515 520 525 

Ser Cys Leu Asp Glu Glu Val Glu Gly Gin Gly Phe Cys Ser Gly Pro 
530 535 540 

Gly Trp Asp Pro Val Thr Gly Trp Gly Thr Pro Asn Phe Pro Ala Leu 
545 550 555 560 

Leu Lys Thr Leu Leu Asn Pro 
565 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Lys Ser Ser Ala Ala Lys Gin Thr Val Leu Cys Leu Asn Arg Tyr 
1 5 10 15 

Ala Val Val Ala Leu Pro Leu Ala lie Ala Ser Phe Ala Ala Phe Gly 
20 25 30 

Ala Ser Pro Ala Ser Thr Leu Trp Ala Pro Thr Asp Thr Lys Ala Phe 
35 40 45 

Val Thr Pro Ala Gin Val Glu Ala Arg Ser Ala Ala Pro Leu Leu Glu 
50 55 60 

Leu Ala Ala Gly Glu Thr Ala His lie Val Val Ser Leu Lys Leu Arg 
65 70 75 80 

Asp Glu Ala Gin Leu Lys Gin Leu Ala Gin Ala Val Asn Gin Pro Gly 
85 90 95 

Asn Ala Gin Phe Gly Lys Phe Leu Lys Arg Arg Gin Phe Leu Ser Gin 
100 105 110 

Phe Ala Pro Thr Glu Ala Gin Val Gin Ala Val Val Ala His Leu Arg 
115 120 125 

Lys Asn Gly Phe Val Asn lie His Val Val Pro Asn Arg Leu Leu lie 
130 135 140 

Ser Ala Asp Gly Ser Ala Gly Ala Val Lys Ala Ala Phe Asn Thr Pro 
145 150 155 160 
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Leu Val Arg Tyr Gin Leu Asn Gly Lys Ala Gly Tyr Ala Asn Thr Ala 
155 170 175 

Pro Ala Gin Val Pre Gin Asp Leu Gly Glu lie Val Gly Ser Val Leu 
130 135 190 

Gly Leu Gin Asn Val Thr Arg Ala His Pro Met Leu Lys Val Gly Glu 
135 200 205 

Arg Ser Ala Ala Lys Thr Leu Ala Ala Gly Thr Ala Lys Gly His Asn 
210 215 220 

Pro Thr Glu Phe Pro Thr lie Tyr Asp Ala Ser Ser Ala Pro Thr Ala 
225 230 235 240 

Ala Asn Thr Thr Val Gly lie lie Thr lie Gly Gly Val Ser Gin Thr 
245 250 255 

Leu Gin Asp Leu Gin Gin Phe Thr Ser Ala Asn Gly Leu Ala Ser Val 
260 265 270 

Asn Thr Gin Thr lie Gin Thr Gly Ser Ser Asn Gly Asp Tyr Ser Asp 
275 280 285 

Asp Gin Gin Gly Gin Gly Glu Trp Asp Leu Asp Ser Gin Ser lie Val 
290 295 300 

Gly Ser Ala Gly Gly Ala Val Gin Gin Leu Leu Phe Tyr Met Ala Asp 
305 310 315 320 

Gin Ser Ala Ser Gly Asn Thr Gly Leu Thr Gin Ala Phe Asn Gin Ala 
325 330 335 

Val Ser Asp Asn Val Ala Lys Val lie Asn Val Ser Leu Gly Trp Cys 
340 345 350 

Glu Ala Asp Ala Asn Ala Asp Gly Thr Leu Gin Ala Glu Asp Arg lie 
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355 360 365 

Phe Ala Thr Ala Ala Ala Gin Gly Gin Thr Phe Ser Val Ser Ser Gly 
370 375 380 

Asp Glu Gly Val Tyr Glu Cys Asn Asn Arg Gly Tyr Pro Asp Gly Ser 
385 390 395 400 

Thr Tyr Ser Val Ser Trp Pro Ala Ser Ser Pro Asn Val lie Ala Val 
405 410 415 

Gly Gly Thr Thr Leu Tyr Thr Thr Ser Ala Gly Ala Tyr Ser Asn Glu 
420 425 430 

Thr Val Trp Asn Glu Gly Leu Asp Ser Asn Gly Lys Leu Trp Ala Thr 
435 440 445 

Gly Gly Gly Tyr Ser Val Tyr Glu Ser Lys Pro Ser Trp Gin Ser Val 
450 455 460 

Val Ser Gly Thr Pro Gly Arg Arg Leu Leu Pro Asp lie Ser Phe Asp 
465 470 475 480 

Ala Ala Gin Gly Thr Gly Ala Leu lie Tyr Asn Tyr Gly Gin Leu Gin 
485 490 495 

Gin lie Gly Gly Thr Ser Leu Ala Ser Pro lie Phe Val Gly Leu Trp 
500 505 510 

Ala Arg Leu Gin Ser Ala Asn Ser Asn Ser Leu Gly Phe Pro Ala Ala 
515 520 525 

Ser Phe Tyr Ser Ala lie Ser Ser Thr Pro Ser Leu Val His Asp Val 
530 535 540 



Lys Ser Gly Asn Asn Gly Tyr Gly Gly Tyr Gly Tyr Asn Ala Gly Thr 
545 550 555 560 
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Gly Trp Asp Tyr Pro Thr Gly Trp Gly Ser Leu Asp lie Ala Lys Leu 
565 570 575 

Ser Ala Tyr He Arg Ser Asn Gly Phe Gly His 
530 585 

) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 5 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met Lys He Glu Lys Thr Ala Leu Thr Val Ala lie Ala Leu Ala Met 
15 10 15 

Ser Ser Leu Ser Ala His Ala Glu Asp Ala Trp Val Ser Thr His Thr 
20 25 30 

Gin Ala Ala Met Ser Pro Pro Ala Ser Thr Gin Val Leu Ala Ala Ser 
35 40 45 

Ser Thr Ser Ala Thr Thr Thr Gly Asn Ala Tyr Thr Leu Asn Met Thr 
50 55 60 

Gly Ser Pro Arg He Asp Gly Ala Ala Val Thr Ala Leu Glu Ala Asp 
65 70 75 80 
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His Pro Leu His Val Glu Val Ala Leu Lys Leu Arg Asn Pro Asp Ala 
85 90 95 

Leu Gin Thr Phe Leu Ala Gly Val Thr Thr Pro Gly Ser Ala Leu Phe 
100 105 110 

Gly Lys Phe Leu Thr Pro Ser Gin Phe Thr Glu Arg Phe Gly Pro Thr 
115 120 125 

Gin Ser Gin Val Asp Ala Val Val Ala His Leu Gin Gin Ala Gly Phe 
130 135 140 

Thr Asn lie Glu Val Ala Pro Asn Arg Leu Leu lie Ser Ala Asp Gly 
145 150 155 160 

Thr Ala Gly Ala Ala Thr Asn Gly Phe Arg Thr Ser lie Lys Arg Phe 
165, 170 175 

Ser Ala Asn Gly Arg Glu Phe Phe Ala Asn Asp Ala Pro Ala Leu Val 
180 185 190 

Pro Ala Ser Leu Gly Asp Ser Val Asn Ala Val Leu Gly Leu Gin Asn 
195 200 205 

Val Ser Val Lys His Thr Leu His His Val Tyr His Pro Glu Asp Val 
210 215 220 

Thr Val Pro Gly Pro Asn Val Gly Thr Gin Ala Ala Ala Ala Val Ala 
225 230 235 240 

Ala His His Pro Gin Asp Phe Ala Ala lie Tyr Gly Gly Ser Ser Leu 
245 250 255 

Pro Ala Ala Thr Asn Thr Ala Val Gly lie He Thr Trp Gly Ser He 
260 265 270 

Thr Gin Thr Val Thr Asp Leu Asn Ser Phe Thr Ser Gly Ala Gly Leu 
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280 
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Ala Thr Val Asn Ser 
290 

Asn Asp Pro Asp Ser 
305 

Val Gly lie Ala Gly 
325 

Asn Gly Asp Ser Ser 
340 

Ser Tyr Asn Arg Ala 
355 

Ser Leu Gly Glu Asp 
370 

Ala Asp Asp Ala lie 
385 

Ser lie Ala Ser Gly 
405 

Thr Ser Gly Ser Pro 
420 

lie Asp Leu Thr His 
435 

Val lie Gin Val Gly 
450 



Thr lie Thr Lys Val Gly 
295 

Asn Gly Glu Trp Ser Leu 
310 315 

Gly Val Lys Gin Leu lie 
330 

Ser Ser Gly lie Thr Asp 
345 

Val Thr Asp Asn lie Ala 
360 

Glu Thr Ala Ala Gin Gin 
375 

Phe Gin Gin Ala Val Ala 
390 395 

Asp Ala Gly Val Tyr Gin 
410 

Gly Tyr Val Ala Asn Ser 
425 

Tyr Ser Val Ser Glu Pro 
440 

Gly Thr Thr Leu Ser Thr 
455 



Ser Gly Thr Phe Ala 
300 

Asp Ser Gin Asp lie 
320 

Phe Tyr Thr Ser Ala 
335 

Ala Gly lie Thr Ala 
350 

Lys Leu lie Asn Val 
365 

Ser Gly Thr Gin Ala 
380 

Gin Gly Gin Thr Phe 
400 

Trp Ser Thr Asp Pro 
415 

Ala Gly Thr Val Lys 
430 

Ala Ser Ser Pro Tyr 
445 

Ser Gly Thr Thr Trp 
460 



Ser Gly Glu Thr Val Trp Asn Glu Gly Leu Ser 
465 470 475 



Ala lie Ala Pro 



Ser 
480 
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Gin Gly Asp Asn Asn Gin Arg Leu Trp Ala Thr Gly Gly Gly Val Ser 
485 490 495 

Leu Tyr Glu Ala Ala Pro Ser Trp Gin Ser Ser Val Ser Ser Ser Thr 
500 505 510 

Lys Arg Val Gly Pro Asp Leu Ala Phe Asp Ala Ala Ser Ser Ser Gly 
515 520 525 

Ala Leu lie Val Val Asn Gly Ser Thr Glu Gin Val Gly Gly Thr Ser 
530 535 540 

Leu Ala Ser Pro Leu Phe Val Gly Ala Phe Ala Arg lie Glu Ser Ala 
545 550 555 560 

Ala Asn Asn Ala lie Gly Phe Pro Ala Ser Lys Phe Tyr Gin Ala Phe 
565 570 575 

Pro Thr Gin Thr Ser Leu Leu His Asp Val Thr Ser Gly Asn Asn Gly 
580 585 590 

Tyr Gin Ser His Gly Tyr Thr Ala Ala Thr Gly Phe Asp Glu Ala Thr 
595 600 605 

Gly Phe Gly Ser Phe Asp lie Gly Lys Leu Asn Thr Tyr Ala Gin Ala 
610 615 620 

Asn Trp Val Thr Gly Gly Gly Gly Gly Ser Thr 
625 630 635 

INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotides* 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO ; 6 : 
GTGATCACAG AATGGCACTT 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotides" 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
AACATGGGTT TCCGTAGGTC 
(2) INFORMATION FOR SEQ ID NO : 8 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION : /desc = "Oligonucleotides" 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
CTTCCTCAGG GTCCGCACGG 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotides" 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TGTAAAACGA CGGCCAGTCA GACCTTCCAG TAGGGACC 
(2) INFORMATION FOR SEQ ID NO:10: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 3 base pairs 
(3) TYPE: nucleic acid 

(C) ST HANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotides " 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CAGGAAACAG CTATGACCCT GTATCCCACA CAAGAGAT 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotides" 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TGTAAAACGA CGGCCAGTTA GATGCCATTG GGGACTGG 
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(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotides" 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 
CAGGAAACAG CTATGACCGT CATGGAAATA CTGCTCCA 



